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EXPLICIT BOUNDS FOR THE PSEUDOSPECTRA OF VARIOUS 
CLASSES OF MATRICES AND OPERATORS 

FEIXUE GONG\ OLIVIA MEYERSON^, JEREMY MEZA^, MIHAI STOIGIU'^, ABIGAIL 

WARD® 


Abstract. We study the e-pseudospectra o'g(A) of square matrices A e . 

We give a complete characterization of the e-pseudospectrum of any 2x2 matrix 
and describe the asymptotic behavior (as e -»■ 0) of a^(A) for any square matrix 
A. We also present explicit upper and lower bounds for the e-pseudospectra 
of bidiagonal matrices, as well as for finite rank operators. 


1. Introduction 


The pseudospectra of matrices and operators is an important mathematical object 
that has found applications in various areas of mathematics: linear algebra, func¬ 
tional analysis, numerical analysis, and differential equations. An overview of the 
main results on pseudospectra can be found in [5]. 


In this paper we describe the asymptotic behavior of the e-pseudospectrum of any 
n X n matrix. We apply this asymptotic bound and additionally provide explicit 
bounds on their e-pseudospectra to several classes of matrices and operators, in¬ 
cluding 2x2 matrices, bidiagonal matrices, and finite rank operators. 


The paper is organized as follows: in Sectionj^we give the three standard equivalent 
definitions for the pseudospectrum and we present the “classical” results on e- 
pseudospectra of normal and diagonalizable matrices (the Bauer-Fike theorems). 
Section contains a detailed analysis of the e-pseudospectrum of 2 x 2 matrices, 
including both the non-diagonalizable case (Subsection |3.1[ ) and the diagonalizable 
case (Subsection |3.2[ ). The asymptotic behavior (as £ -»• 0) of the £-pseudospectrum 
of any n x n matrix is described in Section]^ where we show (in Theorem 4.21 that, 
for any square matrix A, the £-pseudospectrum converges, as e ^ 0 to a union of 
disks. We apply the main result of Section|^to several classes of matrices: matrices 
with a simple eigenvalue, matrices with an eigenvalue with geometric multiplicity 
1,2x2 matrices, and Jordan blocks. 


Section is dedicated to the analysis of arbitrary periodic bidiagonal matrices A. 
We derive explicit formulas (in terms the coefficients of A) for the asymptotic radii, 
given by Theorem |4.2[ of the £-pseudospectrum of A, as £ ^ 0. In the last section 
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(Section]^ we consider finite rank operators and show that the £-pseudospectrum 
of an operator of rank m is at most as big as Ce ^, as £ ^ 0. 


2. Pseudospectra 


2.1. Motivation and Definitions. The concept of the spectrum of a matrix A e 
f^NxN provides a fundamental tool for understanding the behavior of A. As is well- 
known, a complex number z e C is in the spectrum of A (denoted cr(A)) whenever 
zl - A (which we will denote as z - A) is not invertible, i.e., the characteristic 
polynomial of A has z as a root. As slightly perturbing the coefficients of A will 
change the roots of the characteristic polynomial, the property of “membership 
in the set of eigenvalues” is not well-suited for many purposes, especially those in 
numerical analysis. We thus want to find a characterization of when a complex 
number is close to an eigenvalue, and we do this by considering the set of complex 
numbers z such that || (z- A)~^ || is large, where the norm here is the usual operator 
norm induced by the Euclidean norm, i.e. 

||A|| = sup ||A?;||. 

II’'11=1 

The motivation for considering this question comes from the observation that if 
z„ is a sequence of complex numbers converging to an eigenvalue A of A, then 
||(z„-A)“^|| ^ oo as n ^ oo. We call the operator {z-A)~^ the resolvent of A. The 
observation that the norm of the resolvent is large when z is close to an eigenvalue 
of A leads us to the first definition of the £-pseudospectrum of an operator. 

Definition 2 . 1 . Let A 6 , and let £ > 0. The £-pseudospectrum of A is the 

set of z 6 C such that 

||(z-A)-1||>1/£ 


Note that the boundary of the £-pseudospectrum is exactly the 1/e level curve of 
|(z-A)-i|| ~ 


the function z ^ 
the eigenvalues. 


Fig. 


2.1 


depicts the behavior of this function near 



Figure 2.1. Contour Plot of Resolvent Norm 

The resolvent norm has singularities in the complex plane, and as we approach these 
points, the resolvent norm grows to infinity. Conversely, if ||(z-A)“^|| approaches 
infinity, then z must approach some eigenvalue of A [i Thm 2.4]. 
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(It is also possible to develop a theory of pseudospectrum for operators on Banach 
spaces, and it is important to note that this converse does not necessarily hold for 
such operators; that is, there are operators [m such that \{z-A) ^|| approaches 
infinity, but z does not approach the spectrum of A.) 

The second and third definitions of the e-pseudospectrum arise from eigenvalue 
perturbation theory [^. 

Definition 2.2. Let A € . The e-pseudospectrum of A is the set of z e C 

such that 

0 € a{A + E) 

for some E with ||_B|| < e. 

Definition 2.3. Let A € The e-pseudospectrum of A is the set of z e C 

such that 

||(z - ^)n|| < e 

for some unit vector v. 

The third definition is similar to our first definition in that it quantifies how close 
z is to an eigenvalue of A. In addition to this, it also gives us the notion of an 
e-pseudoeigenvector. 

Theorem 2.1 (Equivalence of the definitions of pseudospectra). For any matrix 
A 6 , the three definitions above are equivalent. 

The proof of this theorem is given in [U §2]. As all three definitions are equivalent, 
we can unambiguously denote the e-pseudospectrum of A as ag(A). 

Fig. |2.2| depicts an example of e-pseudospectra for a specific matrix and for var¬ 
ious e. We see that the boundaries of e-pseudospectra for a matrix are curves 
in the complex plane around the eigenvalues of the matrix. We are interested in 
understanding geometric and algebraic properties of these curves. 



Figure 2.2. The curves bounding the £-pseudospectra of a matrix 
A, for different values of e. 

Several properties of pseudospectra are proven in [SJ §2]. One of which is that if 
A 6 then cre(A) is nonempty, open, and bounded, with at most N connected 

components, each containing one or more eigenvalues of A. This leads us to the 
following notation: 
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Notation. For A € a{A), we write o'e{A) f A to be the connected component of 
o'siA) that contains A. 


Another property, which follows straight from the definitions of pseudospectra, is 
that ne>oO'e(^) = From these properties, it follows that there is e small 

enough so that cre(A) consists of exactly |cr(A)| connected components, each an 
open set around a distinct eigenvalue. In particular, there is e small enough so that 
cr(A) ncre(A) 1 A = {A}. 

When a matrix A is the direct sum of smaller matrices, we can look at the pseu¬ 
dospectra of the smaller matrices to understand the e-pseudospectrum of A. We 
get the following theorem from [8]: 

Theorem 2.2. 

ae(Ai ® A 2 ) = o-s(Ai) u ae(A 2 ). 


2.2. Normal Matrices. Recall that a matrix A is normal if AA* = A* A, or equiv¬ 
alently if A can be diagonalized with an orthonormal basis of eigenvectors. 

The pseudospectra of these matrices are particularly well-behaved: Thm. [273| shows 
that the e-pseudospectrum of a normal matrix is exactly a disk of radius e around 
each eigenvalue, as in shown in Fig. |2.3| This is clear for diagonal matrices; it 
follows for normal matrices since as we shall see, the e-pseudospectrum of a matrix 
is invariant under a unitary change of basis. 



Figure 2.3. The e-pseudospectrum of a normal matrix. Note 
that each boundary is a perfect disk around an eigenvalue. 

Theorem 2.3. LetAeC^^^. Then, 

(2.1) a{A) + B{0,e) £ ag(A) for all e > 0. 

Furthermore, A is a normal matrix if and only if 

(2.2) ae{A) = a{A) + B{0,e) for all e > 0. 


The proof of this theorem can be found in [51 §2] 
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2.3. Non-normal Diagonalizable Matrices. Now suppose A is diagonalizable 
but not normal, i.e. we cannot diagonalize A by an isometry of C^. In this case 
we do not expect to get an exact characterization of the e-pseudospectra as we 
did previously. That is, there exist matrices with pseudospectra larger than the 
disk of radius e. Regardless, we can still characterize the behavior of non-normal, 
diagonalizable matrices. 

Theorem 2.4 (Bauer-Fike). Let A e and let A be diagonalizable, A = 

VDV~^. Then for each e > 0, 

a{A) + 5(0,e) £ crg(A) £ cr(T) + B(0,£k(V)) 

where 

<v) = \\v\\\\v-^\\ = "al!^^ 

Smin(r ) 

and Smax(R), Si„in(R) cire the maximum and minimum singular values of V, re¬ 
spectively. 

Here, k,{V) is known as the condition number of V. Note that k(V) > 1, with 
equality attained if and only if A is normal. Thus, k(V) can be thought of as a 
measure of the normality of a matrix. However, there is some ambiguity when we 
define k(V), as V is not uniquely determined. If the eigenvalues are distinct, then 
k(V) becomes unique if the eigenvectors are normalized by ||vj|| = 1. 


2.4. Non-diagonalizable Matrices. So far we have considered normal matrices, 
and more generally diagonalizable matrices. We now relax our constraint that our 
matrix be diagonalizable, and provide similar bounds on the pseudospectra. While 
not every matrix is diagonalizable, every matrix can be put in Jordan normal form. 
Below we give a brief review of the Jordan form. 

Let A 6 and suppose A has only one eigenvalue, A with geometric multiplicity 

one. Writing A in Jordan form, there exists a matrix V such that AV = VJ, where 
J is a single Jordan block of size N. Write 

V = (vi, V2, ■■■ , V„) 

Then, 

AV=[Avi, Av 2 , ... , = (Ati, Vi + Xv 2 , ... , Vn-l + XVn) = V J, 

and hence vi is a right eigenvector associated with A and V 2 ,...,Vn are generalized 
right eigenvectors, that is right eigenvectors for {A - A/)^ for k > 1. Similarly, 
there exists a matrix U such that U*A = JU*, where now the rows of U* are left 
generalized eigenvectors associated with A. 


We can also quantify the normality of an eigenvalue in the same way k{V) quantifies 
the normality of a matrix. 


Definition 2.4. For any simple eigenvalue Xj of a matrix A, the condition number 
of Xj is defined as 


KA,) = 


Ui Vi 


\u*Vj\ 


where Vj and u* are the right and left eigenvectors associated with A,-, respectively. 
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Note: The Cauchy-Schwarz inequality implies that \u*Vj\ < ||%'|| I'Cj ||, so K{Xj) > 1, 
with equality when Uj and Vj are collinear. An eigenvalue for which K{Xj) = 1 is 
called a normal eigenvalue; a matrix A with all simple eigenvalues is normal if and 
only if K{Xj) = 1 for all eigenvalues. 

With this definition, we can find finer bounds for the pseudospectrum of a matrix; in 
particular, we can find bounds for the components of the pseduospectrum centered 
around each eigenvalue. The following theorem can be found for example in [5]. 

Theorem 2.5 (Asymptotic pseudospectra inclusion regions). Suppose A € 
has N distinct eigenvalues. Then, as £ -»• 0, 

N 

cJe{A)^{jB{X„eK{X,) + 0{e^)). 
j=i 


We can drop the 0{£^) term, for which we get an increase in the radius of our 
inclusion disks by a factor of N [TJ Thm. 4]. 

Theorem 2.6 (Bauer-Fike theorem based on /c(Aj)). Suppose A € has N 

distinct eigenvalues. Then Ve > 0, 

N 

ct,{A)^UB{Xj,£Nk{Xj)). 

j=i 

The above two theorems give us upper bounds on the pseudospectra of A only when 
A has N distinct eigenvalues. These results can be generalized for matrices that do 
not have distinct eigenvalues. The following is proven in [3 §52]. 

Theorem 2.7 (Asymptotic formula for the resolvent norm). Let Xj 6 a(A) be an 
eigenvalue of with kj the size of the largest Jordan block associated to Xj. For any 
z 6 tTe(A), for small enough e, 

\z-Xj\ < (Cj£)^, 

where Cj = \\VjT^^ ^^j\\ T = J - XI. 


We extend these results by providing lower bounds for arbitrary matrices, as well 
as explicit formulas for the £-pseudospectra of 2 x 2 matrices. 


3. Pseudospectra of 2 x 2 Matrices 

The following section presents a complete characterization of the £-pseudospectrum 
of any 2x2 matrix. We classify matrices by whether they are diagonalizable or non- 
diagonalizable and determine the e-pseudospectra for each class. We begin with an 
explicit formula for computing the norm of a 2 x 2 matrix. 

, with a, b,c,de C. Let Smax denote the largest singular value of A. 
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Then, 

,,.,,2 „ Tv{A*A) + VTr(A*A)2 -Adet(A*A) 

I'J'-Lj ll^ll - Smax - ^ 

3.1. Non-diagonalizable 2x2 Matrices. Any 2x2 matrix that is non-diagonalizable 
must have exactly one eigenvalue of geometric multiplicity one. In this case, we 
can Jordan-decompose the matrix and use the first definition of pseudospectra to 
show that <Je{A) must be a perfect disk. 

Proposition 3.1. Let A be any non-diagonalizable 2x2 matrix, and let X denote 
the eigenvalue of A. Write A= V JV~^ where 

(^■^) % ■'^(o !)■ 

Given any e, 

(3.3) a,{A) = B {X,\k\) 

where 

I |2 I |2 

(3.4) |fc| = \/Ce + and C = 

\ad - bc\ 


Proof. Let z = \ + k where fc 6 C. Then we have (z - A) ^ = V(z - J) ^. 
Taking the norm, this yields 

\\M\\ 


II (^-^r 


|fc2(ad- bc)y 


where M = 


-c 


-bck + ac + adk 


From (3.1), we obtain that 

^Tr(M*M) + yTr(M*M)2-4det(M*M) 


< ll(z-A)- 


|fcp|ad- bc\\/2 


Note that this function depends only on |A:| = 1^ - A|; thus for any e, cre(A) will be 
a disk. Solving for k to find the curve bounding the pseudospectrum, we obtain 


\k\ 


\ 


|aP + |cP 

\ad - bc\ 


□ 


3.2. Diagonalizable 2x2 Matrices. Diagonalizable 2x2 matrices must have two 
distinct eigenvalues or be a multiple of the identity matrix. In either case, the 
pseudospectra can be described by the following proposition. 

Proposition 3.2. Let A be any diagonalizable 2x2 matrix and let Ai, A 2 he the 
eigenvalues of A and vi,V 2 be the eigenveetors associated with the eigenvalues. Then 
the boundary of ae(A) is the set of points z that satisfy the equation 

(3.5) (e^ - |z - Aip)(e^ - |z - A 2 P) - e^lAi - A 2 P cot^ie) = 0, 

where 9 is the angle between the two eigenvectors. 
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Proof. Since A is diagonalizable, we can write A = VDV ^ where 

^Ai 


(3.6) 


V = 


(: 2) -(o l) 


Without loss of generality, let z = Xi + k. 

Let 7 = Ai - A 2 and r = ad - be. Then, 

||(z-A)"^| = \\V{z-D)-^V-^ 

where 

M = 


\m\ 


rk{‘^ + fc)| ’ 
ad^ + rk -ab^ j 


cdj -bc^ + rkj 


Calculating Tr(M*M), we obtain 

(3.7) Tr(M*M) = \rf (I 7 + kf + \kf + 171^ cot^ 0) , 

where 9 is the angle between the two eigenvectors, which are exactly the columns 
of V. For the determinant, we have 

(3.8) det{M*M) = \r\‘^\kf\k + jf 


Plugging the above into equation (13, we get 

e-i=||(^-Al)-' 




Tr(M*M) + ^Tr(M*M)2 - 4det(M*M) 
2|rp|fc(7 + fc)p 


Re-writing and simplifying, we obtain the curve describing the boundary of the 
pseudospectrum: 

(e^ - \kf){e'^ -\k + 7 ]^) - cot^ 9 = 0. 

□ 


Note that for normal matrices, the eigenvectors are orthogonal. Therefore the 
equation above reduces to 

(3.9) 

which describes two disks of radius e centered around Ai,A 2 , as we expect. 


When the matrix only has one eigenvalue and is still diagonalizable (i.e. when it is 
a multiple of the identity), then we obtain 

{e^-\k\^f = 0, 

which is a disk of radius e centered around the eigenvalue. 


One consequence to note of Proposition 3.2 is that the shape of cr^^A) is dependent 
on both the eigenvalues and the eigenvectors of the matrix A. Another less obvious 
consequence is that the pseudospectrum of a 2 x 2 matrix approaches a union of 
disks as e tends to 0 . 
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Proposition 3.3. Let A be a diagonalizable 2x2 matrix with two distinct eigen¬ 
values, Ai,A 2 . Then, cr^(A) Ixt asymptotically tends toward a disk. In particular. 


.(AO 


.(AO 


1 + Oie), 


where rmax{\),‘rmin{\) are the maximum and minimum distances from Xi, to 
da,;(A) 1';^.. Moreover, for A diagonalizable but not normal, (XsiA) is never a 
perfect disk. 


Proof. Let e be small enough so that the c-pseudospectrum is disconnected. With¬ 
out loss of generality, we will consider as(A) Iai- 


Let Zmax 6 dag(A) such that |zmax - All is a maximum. Set rmax(A) = l^max “ A|. 
Consider the line joining Ai and A 2 . Suppose for contradiction that Zmax did not 
lie on this line. Then, rotate Zmax in the direction of A 2 so that it is on this line, 
and call this new point z'. Note that jz'-A 2 I < |^max-A 2 |, but |z'-Ai| = |zmax-Ai|. 
As such, we get that 


(io-Air-£0(k'-A2r-co<(k 


-Ai|^-e^)(|zmax-A2|^-c^) - e^|Ai-A2p cot^ 9 

Thus, from Proposition 3.2 z' 6 ag{A) but z' is not on the boundary of ag(A). 
Starting from z' and traversing the line joining Ai and A 2 , we can find z" € 9(Te(A) 
such that 1^" - All > |z' - All = l^max - Ai|. This contradicts our choice of Zmax and 
so Zmax must be on the line joining Ai and A 2 . A similar argument shows that 
^min rnust also be on this line, where z^nin ^ ^^e^Adj such that raiin “ |.^min All is a 
minimum. 


Since Zmax is on the line joining Ai and A 2 , we have the exact equality 
kmax “ A 2 I = l^^max “ Ai| + IA 2 - Ai|. 

Let y = IA 2 - Ai|. The equation describing rmax(Ai) becomes 

(Tmax(Al)^ - e^) {{y - rmax(Al))^ - £^) = COt^ 9 

Similarly, we can obtain the equation for rmi„(Ai). Solving for rmax(Ai) and 
^ min (Ai), we get 

(3.10) rinax(Ai) = ^ (^y-\/y^+ 46^-4y£csc9'j 

(3.11) rmin(Ai) = ^ (^\/y^-h 4£^ + 4y£CSC9 - y'j 


For e small, we can use the approximation (1 + e)^’ = 1 + pe + 0(£^). Then, 


(3.12) 
where n = 

' V 

we find that 

(3.13) 


rmax(Al) ^ 1 - ^1 + 4 (£/p)2 - 4(£/p) csc 6> 
^ min (Ai) + A{£jyY +4(£/j/)csc0- 1 

1 + 77£ + 0(£^) 

1 - 77£ + 0{£^) 

Using the geometric series approximation 

fmax , (2cos0cot6>)£ 2. 

- = 1 + j 

£min y 


1 + a; + 0{x^), 
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Thus, cTsIA) tends towards a disk. Moreover, if A is diagonalizable but not normal, 
then the eigenvectors are linearly independent but not orthogonal, so 6 is not a 
multiple of 7r/2 or tt, and therefore cos 0 cot 0 + 0 and 1. □ 

This result can be observed by looking at plots of the pseudospectra of diagonaliz¬ 
able 2x2 matrices. 



Figure 3.1. e-pseudospectra of a diagonalizable 2x2 matrix 


The image on the left shows the pseudospectra of a particular 2x2 matrix. One can 
see that for large enough values of e, the pseudospectra around either eigenvalue 
is not a perfect disk. The image on the right is the pseudospectra of the same 
matrix (restricted to one eigenvalue), with smaller values of epsilon. Here, the 
pseudospectra appear to converge to disks. We find that this result holds in general 
for any N x N matrix and this is proven in the following section. 


4. Asymptotic Union of Disks Theorem 

In Propositions |3.1| and |3.2| we showed that the e-pseudospectra for all 2 x 2 ma¬ 
trices are disks or asymptotically converge to a union of disks. We now explore 
whether this behavior holds in the general case. It is possible to find matrices 
whose e-pseudospectra exhibit pathological properties for large e; for example, the 
non-diagonalizable matrix given in Figure [TT] has, for larger e, an e-pseudospectrum 
that is not convex and not simply connected. 
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Figure 4.1. Pseudospectra of a Toeplitz matrix 
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Thus, pseudospectra may behave poorly for large enough e; however, in the limit as 
c ->■ 0, these properties disappear and the pseudospectra behave as disks centered 
around the eigenvalues with well-understood radii. In order to understand this 
asymptotic behavior, we will use the following set-up (which follows [7]). 

Let A 6 and fix 

J 

where J consists of Jordan blocks Ji,... ,Jm corresponding to the eigenvalue A. J 
consists of Jordan blocks corresponding to the other eigenvalues of A. 

Let n be the size of the largest Jordan block corresponding to A, and suppose there 
are i Jordan blocks corresponding to A of size nx n. Arrange the Jordan blocks in 
J in weakly decreasing order, according to size. That is, 

dim ( Ji) = ••• = dim( J^) > dim ( J^+i) > > dim ( J^) 

where Ji,..., are nxn. 


A 6 (t(A). Write the Jordan decomposition of A as such 


J 


(Q) 

{Q 


A{P P). 


<Q\ 

\Ql 


{p P) = i 


Further partition P, 

P = {Pi , • • ■ , Pi , ■■■ , Pm) 


in a way that agrees with the above partition of J, so that the first column, Xj, of 
each Pj is a right eigenvector of A associated with A. We also partition Q likewise 


(Qi\ 


Q = 


Qe 


\Qmj 


The last row, yj, of each Qj is a left eigenvector of A corresponding to A. 
We now build the matrices 


Y = 


(yi\ 

y2 

\ye/ 


X = (xi , X2 , . . . , Xl) , 


where X and Y are the matrices of right and left eigenvectors, respectively, corre¬ 
sponding to the Jordan blocks of maximal size for A. 


The following theorem is presented by Moro, Burke, and Overton [7] and due to 
Lidskii [S]. 

Theorem 4.1 (Lidskii j^). Given l,n as defined above corresponding to the matrix 
A, there are in eigenvalues of the perturbed matrix A + eE admitting a first order 
expansion 
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forj = k = 1,... ,n, where 7 j are the eigenvalues ofYEX and the different 

values of Aj^(e) for k = 1,..., n are defined by taking the distinct roots of . 

Lidskii’s result can be interpreted in terms of the e-pseudospectrum of a matrix A 
in order to understand the radii of cre(A) as e ^ 0. 

Theorem 4.2. Let A 6 Let e > 0. Given A € (t{A), for e small enough, 

there exists a connected component U £ crg{A) such that U na{A) = A; denote this 
component of the e-pseudospectrum <Je{A) Ia- 

Then, as e ^ 0, 

B{X, (Ce)!/" + £ a,(A) B{X, + o(e^/^)) 

where C = ||Xy||, with X,Y defined above, and n is the size of the largest Jordan 
block corresponding to A. 


Proof. Lower Bound: Give E 6 let 7 max(L') be the largest eigenvalue of 

YEX. It is shown [71 Theorem 4.2] that 

a := max 7 max(L^) = ||^A"||, 

|| E ||<1 

Moreover, the E that maximizes 7 is given hy E = vu where v and u are the 
right and left singular vectors of the largest singular value of XY, normalized so 
||i;|| = ||u|| = 1. We claim that i3(A, (jaje)^^" + o(e^'^")) £ cr,,(A) [x. 


Fix E = vu, with v,u defined above, fix 0 € [ 0 , 2 n 7 r], and define E = e^^E. Note 
that 7 is an eigenvalue of YEX iff e ®®7 is an eigenvalue of YEX. Since a is an 
eigenvalue of E, then an eigenvalue of YEX. Considering the perturbed 


matrix A + eE, theorem 
form 


4.1 


implies that there is a perturbed eigenvalue A(e) of the 


A(e) = A+(e*Ve)i/" + o(e^/”) 




and thus A(e) e (Te(Al) \\. Ranging 9 from 0 to 2 n 7 r, we get the desired result. 


Upper Bound: Using the proof of |S1 Theorem 52.3], we know that asymptotically 
aM) rA£ B{X,{fiefl- + o{e^l'^)), 
where fi = and J = XI + D. We claim fi = ||^U|j = a. 


Note that Z?" ^ = diag[ri,... Ff, 0] where F^ is a n x n matrix with a 1 in the top 
right entry and zeros elsewhere. We hnd 




Xi X2 


XI 


This then gives 


PD'^-^Q 


Thus fi = |lPP"-iQ|l = llXrI = a. 



□ 
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We present special cases of matrices to explore the consequences of Theorem |4.2| 
Special Cases: 

(1) A is simple. 

Then, n = 1 and X and Y become the right and left eigenvectors x and 
y* for A, respectively. Hence, C = ||Xy|| = ||a:y*|| = ||a;||||?/|| = k{X) where we 
normalize so that \y*x\ = 1. Then, Theorem 4.2 becomes 

o-e(^) 1 a~ B{X,K{X)e) 

which matches with Theorem 12.51 

(2) A has geometric multiplicity 1. 

In this case we obtain the same result for when A is simple, except n 
may not equal 1. In other words, 

ae(A) rv«H(A,(«(A)c)i/"). 

(3) H6C2X2. 

There are two cases, as in Section 

First, assume A is non-diagonalizable. In this case, A only has one eigen¬ 
value, A. Writing A = VJV~^, where V and J are as defined in equation 


(3.21, we have that, 
X={a 


4' 


Y = 


1 


ad - be 

From Theorem |4.2[ we then have that as e 

,1/2 

I I \l I \ ~ -*• \ t ’\~ ' 

a^{A) « h( A, 


(-C 

- 0 , 


i). 


’ \ \ad - bc\ ) 


+ o 


(P'=) 


This agrees asymptotically with equation |3.4| however |3.4| gives an explicit 
formula for ag{A). 

In the case where A is diagonalizable, A has two eigenvalues, Ai and A 2 . 
Again, we write A = VDV~^ where V and D are as defined in equation 
(3.6 1 . From this, we have 

lixrii = 


ap + |cp)(|&p + 


0 


= esed. 


\ad - bc\ 

Thus, as e -*■ 0, we have from Theorem |4.2| 

i? (A, {esc 9) e + o(e)) £ erg (A) £ H (A, (csc0) e + o (e)). 

So, 

Tmax _ (csc9)e + o(e) _ ^ ^ 

Tmin (csc6»)e + o(e) 

This agrees with the ratio we obtain from the explicit formula for diago¬ 
nalizable 2x2 matrices; however, equation (3.131 gives us more information 
on the 0 ( 1 ) term. 

(4) A is a Jordan block. 

From 0 pg. 470], we know that the e-pseudospectrum of the Jordan 
block is exactly a disk about the eigenvalue of J of some radius. An explicit 
formula for the radius remains unknown, however we can use Theorem |4.2| 
to find the asymptotic behavior. 
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Proposition 4.1 (Asymptotic Bound). Let J be an N x N Jordan block. 
Then 




Proof. The N x N Jordan block has left and right eigenvectors Uj and Vj 


where ||uj|| 
Wviufl = 1 . 


= 1 and 
Thus, 


= 1. So, from Theorem 


4.2 


we find C = ||Ary|| = 






□ 


By a simple computation, we can also get a better explicit lower bound 
on the e-pseudospectra of e,n N x N Jordan block, that agrees with our 
asymptotic bound. 

Proposition 4.2. Let J be an N x N Jordan block. Then, 

b ( x , V£(l + e)^-i)ca,(J). 


Proof. We use the second definition for ag{J). Let 


/O 



k 

0 


k 


\k 


k 

0 / 


where |fc| < e, and note that ||i?|| < e. We take det( J + E - zl) and set it 
equal to zero to find the eigenvalues of J + A. 


0 = det( J + E - zl) 

' X-z k + 1 
X - z 


= det 


k+1 


k 


k + \ 
X - Z) 


= (A-z)^ + (-l)^-iyfc(l + fc)^-i 

^ (z-A)^ = /fc(l + fc)^-^ 

0-A= \Jk{\ + kY-^. 


So, b[x, ^£(l + £)^-i)ca,(J). 


□ 


5. Pseudospectra of bidiagonal matrices 

In this section we consider bidiagonal matrices, a class of matrices with impor¬ 
tant applications in spectral theory and mathematical physics. We investigate the 
pseudospectra of periodic bidiagonal matrices and show that the powers n and 
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the coefficients C in Theorem |4.2| can be computed explicitly. We consider the 
coefficients and {bkj^Ji which define the bidiagonal matrix 


' ai bi 

02 62 


A = hidiag{{ak}k=i,{bk}k=i) = 


\ 


\ 


Oat-I bN-1 

Qn I 


Note that if 6 ^ = 0 for some i, then the matrix A “decouples” into the direct 
sum A = bidiag({a/c}fc^i,{5fc}*j.“\) ® hidiag({ak}^^i+i, {bk}Z~ili) and by Theorem 
|2.2| the pseudospectrum of A is the union of pseudospectra of smaller bidiagonal 
matrices. Therefore we can assume, without loss of generality, that b^ + d for any 
*€{1,2,...,7V-1}. 


Note also that the eigenvalues of A are {oi, 02 ,..., a„} and some eigenvalues may be 
repeated in the list. In order to apply Theorem |4.2| we have to find the dimension 
of the largest Jordan block associated to each eigenvalue of the matrix A. The 
following proposition addresses this question: 


Proposition 5.1. Let A = bidiag{ 6 ^,}^']^) with bi i= 0 for any i and 
suppose that a is an eigenvalue of A. Then dhiiN{A- al) = 1, where N{A - al) is 
the eigenspace corresponding to the eigenvalue a of the matrix A. 


Proof. Suppose a = = Oi^ = ... = Qi^ where 1 < ii < i 2 < ■■■ < im ^ N and a + au 

for any fc e {1,2,..., n} \ {ii, ^ 2 , ■ ■ ■, *m}- We have 


' Oi - a 61 
02 — a 


A - al = 


bii-2 

Uii-l “ O bi^-i 

0 bi^ 

G^i+i ~ d 


\ 


\ 


bN-1 
On - a j 


Let US denote by {ci, C 2 ,..., cat} the columns oi A-al and by {ei, 62 ,..., oat} 
the standard canonical basis in . Since bk i= 0 for any k 6 {l,2,...,iV-l} 
we obtain that columns {c 2 , C 3 ,..., cat} are linearly independent. Moreover, we 
also have Span(c 2 , C 3 ,..., ) = Span(ei, 82 ,..., which in turn implies that 

Cl 6 Span{c 2 , C 3 ,..., Cij}. We conclude that the rank of the matrix A- al is N- 1, 
hence dim N{A - al) = 1. □ 


The previous proposition implies that, under the assumption bi t 0 for any i, if a is 
an eigenvalue of the matrix A = bidiag of algebraic multiplicity 

m, then there is only one Jordan block associated to the eigenvalue a. 
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We now consider the special case of periodic bidiagonal matrices. Let A be an TV xA^ 
matrix with period k on the main diagonal and nonzero superdiagonal entries 


/«i &i 


\ 


A = 


bN-i 

dj- j 


We have from Theorem |4.2| and Proposition |5 .1 1 that 



where rij is the size of the Jordan block corresponding to dj and also the number of 
times dj appears on the main diagonal. Moreover, the constant Cj that multiplies 
any eigenvalue dj is simply Cj = I'Cj || ||Mj ||, where Vj and Uj denote the right and 
left eigenvectors, respectively. We will give the explicit expressions for Vj and Uj. 

We will begin by introducing £-pseudospectrum for simple special cases which lead 
to the most general case. 

The cases will be presented as follows: 

(1) Let A he a kn X kn matrix with ai,... ,dk distinct. 

(2) Let Al be an TV X TV matrix with ui,..., distinct. 

(3) General Case: Let A be an TV x TV matrix with ai,..., not distinct. 

To shorten notation for the rest of this section, we define 



Case 1: 


• The size of A is kn x kn. 

• The Oi’s are distinct. 


We write the elements of the superdiagonal as bi,b 2 ,. ■ b^-i- Let p= k{n-l) +j. 
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We have that: 

_ bi-bj.i _\ 

_ b2-"bj^i _ 

f {aj-a2)---f{aj-aj-i) 

bj-i U* = ( _ ^ _ 

1 
0 

0 y 

Direct computation will show that these are indeed left and right eigenvectors 
associated with any eigenvalue Uj. 

Case 2: 

• The size of A is x 

• The ay’s are distinct. 

We relax our assumption that the size of our matrix is kn x kn, for period k on the 
diagonal. Let n,r be such that N = kn + r, where 0 < r < k. In other words, ar is 
the last entry on the main diagonal, so the period does not necessarily complete. 

For Qj, the right eigenvector is given by 

= ( _ bi-bj-i _ _ b2-bj^i _ __ _&3-1_ , Q __ 

We split up the formula for the left eigenvectors into two cases: 

(1) 1 < j < r 

( 2 ) r < j < k. 

(1) 1 < j < r 

On the main diagonal, there are n complete blocks with entries Oi,..., a^, and one 
partial block at the end with entries oi,.. .a^. In the first case, when 1 < j < r, 
then Qj is in this last partial block. In this case then, let p = kn + j. 

We have that 

0 
0 

■ f{aj+i - aj)---f{ar - Oj) 

{bj—bp) ■ f{aj+2 - aj)---f{ar - aj) 

{bj—bN- 2 ) ■ f{ar - Oj) 
bj-"bM-i 
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where 

_ /(ai-aj)-/(aj-i-aj) _ 

[fio-i - aj)-fiak - aj)]”/(ai - aj)--f{ar - a^)' 


(2) r < j < k 

In this case, aj is in the last complete block. Now, let p = k{n - 1) + j- 


We have that 


Uj = 


{bj---bp^i) ■ f{ai - aj)---f{ar - aj)f{aj+i - aj)---f{ak - aj) 
{bj—bp) ■ /(oi - aj)—f{ar - aj)f{aj+2 - aj)—f{ak - aj) 

{bj—bp+k-j-i) ■ f{ai - aj)—f{ar - aj) 
(bj-'-bp+k-j) ■ f{a2 - aj)---f{ar - a^) 

{bj—bN-2) ■ f{ar - ttj) 
byb^-l 


again where 


Hj = 


f{ai-aj)-f{aj-i-aj) 


[/(oi - aj)—f{ak - aj)]”/(ai - aj)—f{ar - a,) 


Case 3: General Case. 


• The size of is N x 

• The Qi ’s are not distinct for 1 < i < k 

Let A be a X TV periodic bidiagonal matrix with period k on the main diagonal. 
Let n,r be such that N = kn + r, where 0 < r < fc. Write ai,... ,ak for the entries 
on the main diagonal (a^’s not distinct) and &i,...,5jv-i for the entries on the 
superdiagonal. Let be the last entry on the main diagonal. 

We can explicitly find the left and right eigenvectors for any eigenvalue, a. Sup¬ 
pose a first appears in position £ of the period k. Then the corresponding right 
eigenvector for a is the same form as Vi in case 2. That is, 

_ /__ _ b2-"bi_i _ _ _ bi^i 1 Q 

^~\f(ae-ai)---f(ae-ae-i)’ f(ai-a2)—fiae-at-i)’ ’ /(a£-a^_i)> > ’ ' ) ' 

The corresponding left eigenvector for a depends on the first and last positions of 
a. Let A:(n-1) = + s and setq = m (mod A:). We split up the formula for the left 

eigenvector of a into two cases, which again mirror the formulas given in case 2: 


(1) 1 < f' < r 

(2) r <£< k. 
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For both of these two cases, we define 


g{h) 


Ibi, i>p 
i<p 


( 1 ) l<^<r 

In this case then, a. appears in the partial block. Let p = kn + £. We have that 


ui = Pi 


0 

0 

gibp+m-^-l )/( ^m+1 - ai)—f{ar - ai) 
9(,bp+rn-£-l ')9(,bp+rn-l') f (am+2 “ “ O.ff') 


g{bp+m-e-i)"-g{bN-2)f{ar - ai) 
g{bp+m-i-i)-"g{bN-i) 


where pi = 


_ bf-bp-if(ai-ae)---f{ae~i-ae) _ 

[/(ai-Of)-'-/(ofc-af)]"/(ai-a^)---/(ar-a£) ' 


(2) r<l<k 

In this case, a is in the last complete block. Here, we let p = k{n - 1) + i. Now, we 
have 

/ 0 \ 


0 

g{bp+m-i-i)f{ai - ai)---(ar - ai)f(am+i - ai)—f{ak - ai) 
g{bp+m-i-i)g{bp+Tn-i) f{ai - ai)---f{ar - ai)f(am +2 - ai)—f{ak - ai) 

g{bp+m-i-i)—g{bp+k-£-i)f{ai - ai)—f{ar - ai) 
gibp+rn-£-l\"gibp+k-£) f (a2 “ ~ 0.^) 

g{bp+m-i-i)---g{bN-2)f{ar - ai) 

g(bp+,p,_i_i)---g(bjY-i) 


ui = pi 


where 


pe = 


{bfbp-i) ■ /(oi - ai)—f{ai-i - ai) 

[/(oi - ai)—f{ak - ai!)]"/(ai - ai)—f{ar - ai) 


From these formulas, we can find the eigenvectors, and hence the asymptotic be¬ 
havior of the £-pseudospectrum for any bidiagonal matrix, 

j=i \ 

where Cj = ||Pj||||Mj || and nj is the size of the Jordan block corresponding to aj. 

Note: Let H be a periodic, bidiagonal matrix and suppose b^ = 0 for some i. Then 
the matrix decouples into the direct sum of smaller matrices, call them Hi,..., H„. 
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To find the e-pseudospectrum of A, apply the same analysis to these smaller ma¬ 
trices, and from Theorem |2.2[ we have that 

n 

(^eiA) = [Jae{Ai). 

i=l 


6. Finite Rank operators 

The majority of this paper has focused on both explicit and asymptotic characteri¬ 
zations of e-pseudospectra for various classes of finite dimensional linear operators. 
A natural next step is to consider finite rank operators on an infinite dimensional 
space. 

In section we defined e-pseudospectra for matrices, although our definitions are 
exactly the same in the infinite dimensional case. For our purposes, the only note¬ 
worthy difference between matrices and operators is that the spectrum of an oper¬ 
ator is no longer defined as the collection of eigenvalues, but rather 

cr{A) = {A I A/ - A does not have a bounded inverse} 

As a result, we do not get the same properties for pseudospectra as we did previ¬ 
ously; in particular, cre(A) is not necessarily bounded. 

That being said, the following theorem shows that finite rank operators behave sim¬ 
ilarly to matrices, in that asymptotically the radii of e-pseudospectra are bounded 
by powers of epsilon. The following theorem makes this precise. 

Theorem 6.1. Let V be a Hilbert space and A : V ^ V a finite rank operator on 
H. Then there exists C such that for sufficiently small e, 

CTe(A) £ cr(A) + R(0, Ce^+i). 

where m is the rank of A. Furthermore, this bound is sharp in the sense that there 
exists a rank-m operator A and a constant c such that 

CT£(A) 3 cr(A) + B{0, ce ^^) 

for sufficiently small e. 


Proof. Since A has finite rank, there exists a finite dimensional subspace U such 
that V = U ®W and A{U) £ U, and A(W) = {0}. Choosing an orthonormal basis 
for A which respects this decomposition we can write A = A'®0. Then the spectrum 
of A is cr(A') u {0}, and we know that for any e, 

(7e(A) = ae(A')uae(0). 

The e-pseudospectrum of the zero operator is well-understood since this operator 
is normal; for any e, it is precisely the ball of radius e. It thus suffices to consider 
the £-pseudospectrum of the finite rank operator A' ■■ U U, where U is finite 
dimensional. The e-pseudospectrum of this operator goes like where j is the 
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dimension of the largest Jordan block; we will prove that j <m + l. Note that the 
rank of the nx n Jordan block given by 


/A 1 0 0 ... \ 

0 A 1 0 ... 

: : : ^n-1 

\0 0 0 0 A / 


is n if A + 0, and n - 1 if A = 0. Since we know that the rank of A is larger than 
or equal to the rank of the largest Jordan block, we have an upper bound on the 
dimension of the largest Jordan block: it is of size m + 1, with equality attained 
when A = 0. By Thm. 4.2 we then know that o'i-(A) is contained, for small enough 
e, in the set a{A) + Cesur. 


Note that this bound is sharp; we can see this by taking V to be and consid¬ 
ering the rank-m operator 

/O 1 0 0 ...\ 

0 0 1 0 ... 

Am — • • '• '. . . . 

1 

\o 0 0 0 0 / 

the pseudospectrum of which will contain the ball of radius e™+i by proposition 
1421 □ 


Open Questions. 


The natural question to ask now is whether we can extend this result to more 
arbitrary operators on Hilbert spaces. In particular, for a bounded operator A, 
we would like to establish if there exists a continuous function r^(e) such that for 
sufficiently small s, 


For a matrix A, we proved in Thm. 


cre(A) £ <t(A) + B(0,rA(e)). 

that r^Ce) = Ce^!^ 


4.2 


where n is the size 


of the largest Jordan block associated to A, and C is a constant that depends on 
the left and right eigenvectors assoc iated to a certain eigenvalue. For a finite rank 
operator A, we proved in Thm. 


operator and C is as above. 


6.1 


that rA(s) = Ce^+i, where m is the rank of the 


For closed but not necessarily bounded operators, the picture is more complex, as 
the spectrum need not be bounded or even non-empty. For example, the operator 
A : u u' in L^[0,1] with domain D{A) being the set of absolutely continuous 
functions on [0,1] satisfying u(l) = 0 has empty spectrum. With D{A) being the 
entire space, then the spectrum of A is the entire complex plane. Davies [3] also 
provides an example of an unbounded operator with unbounded pseudospectrum. 

Given these examples, we can see that Thm. |6.1| will not generalize to unbounded 
operators, as the pseudospectrum of an unbounded operator may be unbounded 
for all e. 
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Nonetheless, we do still have a certain convergence of the e-pseudospectrum to the 
spectrum [SI §4], namely ng>o cr^(A) = cr{A). Also, while the e-pseudospectrum 
may be unbounded, any bounded component of it necessarily contains a compo¬ 
nent of the spectrum. These results imply that the bounded components of the 
e-pseudospectrum must converge to the spectrum. Therefore, if we restrict our at¬ 
tention to these bounded components, we can attempt to generalize Thms. |4.2|and 


6.1 by asking whether the bounded components of o'g(A) converge to the spectrum 


as a union of disks. 
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